Syntactical Parsing for Ayta Abellen using PAWS generated Phrase Structure Rules
نویسنده
چکیده
Automated syntactic parsing of Philippine languages could be foundational to future machine translation systems. Rule based systems for Philippine languages have typically not reached a level of wide coverage of language phenomena. The syntax parsing system described here uses the PAWS (Parser and Writer of Syntax) expert system to generate phrase structure rules. After customized rules common to most Philippine languages were added in the process of bringing a training data set up to 100% parsing, the auto-generated phrase structure rules were able to produce a correct parse for 81% of sentences in two Ayta Abellen native authored running texts. The customizations made for the training set helped further the development of the PAWS expert system for use with Philippine languages. The 81% parsing rate is significant in that it represents a wide range of coverage for a rules-based system.
منابع مشابه
Motifs de graphe pour le calcul de dépendances syntaxiques complètes
This article describes a method to build syntactical dependencies starting from the phrase structure parsing process. The goal is to obtain all the information needed for a detailled semantical analysis. Interaction Grammars are used for parsing; the saturation of polarities which is the core of this formalism can be mapped to dependency relation. Formally, graph patterns are used to express th...
متن کاملمدل ترجمه عبارت-مرزی با استفاده از برچسبهای کمعمق نحوی
Phrase-boundary model for statistical machine translation labels the rules with classes of boundary words on the target side phrases of training corpus. In this paper, we extend the phrase-boundary model using shallow syntactic labels including POS tags and chunk labels. With the priority of chunk labels, the proposed model names non-terminals with shallow syntactic labels on the boundaries of ...
متن کاملPhrase Structure Parsing with Dependency Structure
In this paper we present a novel phrase structure parsing approach with the help of dependency structure. Different with existing phrase parsers, in our approach the inference procedure is guided by dependency structure, which makes the parsing procedure flexibly. The experimental results show our approach is much more accurate. With the help of golden dependency trees, F1 score of our parser a...
متن کاملPhrase Structure Annotation and Parsing for Learner English
There has been almost no work on phrase structure annotation and parsing specially designed for learner English despite the fact that they are useful for representing the structural characteristics of learner English. To address this problem, in this paper, we first propose a phrase structure annotation scheme for learner English and annotate two different learner corpora using it. Second, we s...
متن کاملAutomatic Generation of Composite Labels Using Part-of-Speech Tags for Parsing Korean
We propose a format of a binary phrase structure grammar with composite labels. The grammar adopts binary rules so that the dependency between two sub-trees can be represented in the label of the tree. The label of a tree is composed of two attributes, each of which is extracted from each sub-tree, so that it can represent the compositional information of the tree. The composite label is genera...
متن کامل